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Summary  of  Most  Important  Results 

We  have  proposed  to  do  research  In  four  related  areas:  design  of  parallel 
algorithms,  design  of  efficient  VLSI  circuits,  study  of  movement  In  distributed 
computations  and  the  use  of  randomness  In  parallel  computations. 

We  have  been  quite  successful:  the  project  has  yielded  well  over  30  publi¬ 
cations,  two  completed  Ph.D.  dissertations  (all  recelplents  are  currently  hold¬ 
ing  academic  positions,  one  at  University  of  Southern  California,  and  one  at 
Harvard  University),  several  MS  papers,  and  three  additional  Ph  D  disserta¬ 
tions  In  final  phase  of  completion.  We  discuss  the  scientific  accomplishments 
of  some  of  the  papers  below.  It  may  be  significant  to  note  that  the  results  are 
not  only  appearing  In  prestigious  Journals  (SIAM  J.  COMPT.,  JACM,  Journal 
of  Computer  and  Systems  Sciences,  IEEE  Transactions  on  Computers)  but 
they  have  been  presented  at  very  selective  conferences  (like  the  MIT  VLSI 
Conference  (1  paper)  the  IEEE  Foundations  of  Computer  Science  (5  papers) 
and  the  ACM  Principles  of  Programming  Languages  Conference  (1  paper)  ). 
Usually  only  one  of  4  or  5  submissions  are  accepted  at  these  conferences,  and 
the  submitted  papers  are  generally  of  very  high  quality. 

On  the  more  applied  end  of  things,  as  a  result  of  ARO  support  there  are 
now  active  VLSI  design  facilities  at  Penn  State.  Several  chips  have  been 
designed  and  fabricated.  In  addition,  considerable  progress  has  been  made  In 
research  about  synthesis  of  VLSI  circuits  and  CAD  tools  for  VLSI  design.  At 
university  of  Maryland,  J.  Ja’Ja'  and  his  students  have  developed  a  new 
methodology  to  compile  logic  that  does  not  use  the  PLA  approach.  At  Penn 
state.  Larry  Jones,  a  Ph.D.  student  supported  by  the  contract,  and  J.  Simon 
have  di-veloped  on  Incremental  hierarchical  circuit  editor  based  on  the  precise 


and  rigorous  foundations  of  attribute  grammars. 

We  believe  that  our  main  scientific  achievements  have  been  the  following: 

Theory  oj  Communication  Complexity:  Together  with  our  students  V.K.  Pra- 
sanna  Kumar  and  R.  Paturl,  we  have  contributed  significantly  to  this  n^w 
theory.  Our  reported  results  characterize  precisely  the  amount  of  Information 
that  needs  to  be  exchanged  between  two  computing  agents  In  order  to  com¬ 
pute  a  given  function.  We  have  extended  the  theory  to  Interesting  new  models 
of  computation  and  obtained  elegant  mathematical  charecterlzatlons.  Our 
work  Is  extensively  referenced  and  has  Inspired  new  theoretical  developments. 

Lower  Bounds  for  VLSI  Computations:  Using  Information  transfer  arguments 
(and  In  many  cases,  developing  new  powerful  methods  to  account  for  Informa¬ 
tion  flow),  we  have  been  able  to  prove  minimal  requirements  on  the  area  and 
the  time  of  \1.SI  chips  to  compute  a  large  number  of  problems.  Some  of  the 
lower  bounds  are  general,  while  many  are  applicable  to  Interesting  problems 
like  graph  connectivity,  sorting  and  problems  In  signal  processing.  Some  of 
our  earlier  work  tia.s  heeii  Included  In  I 'liman’s  text  on  \1jSI. 

Algorithms  and  Sp>  ciiil-rurjiose  Architectures  for  VLSI  We  have  developed 
novel  arcMlteei  nres  Pascd  on  recently  discovered  fast  algorithms  to  solve 
several  basic  protil  ni"  In  >1loi  i|  i-rocesslng.  ()ur  approach  will  result  In  fully 
plpellne<|  bit  serial  arcliltcci  iires  w  hlidi  recpjlre  no  control  units.  We  have 
shown  that  the  aera  Is  about  tin-  tnlnlmitm  possible  and  the  overall  delay  Is 
within  an  optimal  order  of  magnitude.  An  essential  Ingredient  of  these  Imple¬ 
mentations  Is  the  use  of  digit  otillne  adder  and  multiplier  cells.  Several  chips 
have  been  fabricated  for  the  DI’T  processor.  Other  type  of  archltecttires  we 


4 


considered  uses  serial  memories  (such  as  those  appearing  In  magnetic  disks, 
magnetic  bubble,  charge  coupled  devices,  and  even  VLSI  shift  registers)  that 
have  a  high  Information  density  and  low  cost.  We  have  developed  optimal 
algorithms  for  sorting,  computing  the  discrete  Fourier  transform  and  matrix 
operatlons.On  the  other  hand,  optimal  algorithms  for  several  graph  problems 
were  obtained  by  one  of  our  students  (S.  Hambrusch). 

Theory  of  Computation:  Although  not  a  main  focus  of  research,  the  techniques 
used  In  some  of  our  proofs  yielded  excellent  results  In  nearby  areas  of  theoreti¬ 
cal  computer  science.  For  example,  arguments  used  for  Information  transfer 
(Kolmogorov  complexity)  gave  powerful  lower  bounds  on  the  time  required  for 
certain  on-line  computations.  Similarly ,very  sharp  lower  bounds  have  been 
obtained  for  certain  probabilistic  space-bounded  computations  and  restricted 
ways  of  computing  the  permanent. 

CAD  Tools:  A  new  approach  for  compiling  logic  has  been  developed  by  Ja'Ja' 
and  his  students.  The  standard  approach  Is  usually  based  mostly  on  using 
PLA’s  and  sometimes  Weinberger  Arrays.  Our  method  tries  to  exploit  all  the 
symmetries  Inherent  In  the  given  functions  and  forms  a  layout  that  Is  espe¬ 
cially  tailored  for  symmetric  or  partially  symmetric  functions.  We  can 
rigorously  show  that  If  the  given  functions  have  a  reasonable  degree  of  sym¬ 
metry,  then  our  method  will  be  considerably  better  that  any  of  the  known 
methods,  in  ca.se  no  symmetry  Is  detected,  some  transformation  are  applied 
and  the  resulting  functions  are  guaranteed  to  have  a  certain  degree  of  sym¬ 
metry.  We  can  show  that  we  will  always  end  up  with  a  layout  that  Is  at  least 
as  good  as  the  one  produced  by  the  PLA  approach.  A  new  system 
SYMmetrlc  lloolean  Layout,  has  been  built  and  we  are  In  the  process  of 


experimenting  with  It. 


Larry  Jones  and  J.  Simon  are  In  the  final  stages  of  Implementing  a  struc¬ 
tured  circuit  editor  based  on  attribute  grammars.  The  system  will  maintain 
dynamically  properties  of  the  circuit  being  designed.  When  a  subcircuit  Is 
redesigned  only  the  changes  caused  by  the  substitution  will  have  to  be  recom¬ 
puted.  The  selection  of  what  to  recompute  and  In  what  order  Is  done  fairly 
optimally  by  the  system.  The  algorithms,  an  extension  of  similar  techniques 
used  for  Incremental  grammar-  based  editors,  are  new.  We  believe  that  such 
systems  are  going  to  be  practical,  and  will  enable  the  construction  of  efficient 
"expert"  editors  for  VLSI  designs.  As  an  Illustration  of  the  power  of  the  for¬ 
malism,  It  was  possible  to  add  to  the  sytem  a  version  of  the  MOSSIMII  circuit 
simulator  with  very  little  effort.  While  the  Implementation  Is  very  slow.  It 
took  only  few  days  of  programming  to  bring  It  up.  A  preliminary  report  of 
this  work  was  given  at  the  Principles  of  Programming  Languages  Conference 
this  January. 

In  addition  to  the  major  pieces  of  work  discussed  above,  several  of  our 
students  have  done  work  on  VXSI  design  tools  or  designed  small  chips.  There 
was  work  on  F’L.V  generators,  Weinberger  Array  Generators,  design  of  adders 
as  well  ivs  a  large  number  of  MS  papers  that  although  not  supported  by  the 
contract,  would  not  have  been  possible  without  the  Initial  Impetus  to  VLSI 
design  at  Penn  State  given  by  the  contract. 

Parallel  Computation:  Much  research  effort  has  been  expanded  Into  mapping 
algorithms  designed  for  not  realistic  general  purpose  computers  Into  models 
that  can  In  fact  be  realized,  either  with  present  technology,  or  with  technol- 


ogy  that  should  be  available  within  a  decade.  We  mention  the  following 
results.  With  Y.  Chang  (a  graduate  student  supported  by  the  project)  and  S. 
Hambruch,  we  Investigated  the  feasibility  of  mapping  parallel  algorithms  with 


given  submodules  Into  systolic  arrays.  Unfortunately  we  showed  that  except  In 
very  simple  cases  such  attempts  are  bound  to  fall:  even  the  problem  of  decid¬ 
ing  whether  a  given  module  Is  feasible  Is  already  NP-complete.  Another  major 
result,  obtained  recently  by  Y  Chang  and  J  Simon,  concerns  routing  of  mes¬ 
sages  on  the  hypercube.  We  show  that  It  Is  possible  to  pipeline  routing 
Instances  efllclently.  More  precisely.  It  has  been  known  that  It  Is  possible  to 
I  route  probabilistically  on  an  n-dlmenslonal  hypercube  In  time  0(n).  We  show 

that  this  Is  true  even  If  one  starts  a  new  routing  Instance  every  c  units  of 
time.  More  precisely,  there  are  constants  c  and  d  such  that  Is  a  new  batch  of 
messages  appears  at  times  Ic  for  1=0, 1,2,....  then  the  messages  will  reach  their 
destination  by  time  Ic-l-dn  with  overwhelming  probability.  It  was  not  known 
previously  that  n  batches  could  be  routed  In  less  than  0{n‘^)  time,  nor  that 
the  probabilistic  scheme  could  be  repeated  Indefinitely  without  serious  degra¬ 
dation.  Since  several  hypercube  mulllcomputers  have  been  built  recently,  such 
schemes  have  more  than  academic  Interest. 

Fault-Tolerant  Distributed  Computation:  Massively  parallel  computers  will 
have  falling  components.  Our  research  has  dealt  with  the  problem  In  two 
Important  ways: 

a)  the  reduced  hardware  architecture  proposed  by  Ja’Ja’  and  Ow^ens  Is 
well  suited  for  graceful  degradation  In  the  presence  of  certain  classes  of  pro¬ 


cessor  failure. 

b)  \'ery  recent  work  by  Herman  and  Simon  deals  with  clock  failures.  We 


show  that  there  Is  only  a  constant  factor  slowdown  If  one  uses  a  self-timed 
circuit  to  simulate  a  fully  synchronous  one.  Since  self-timed  circuits  have  no 
global  clock,  and  the  result  remains  true  under  fairly  strong  modes  of  local 
clock  skew  (like  no  assumptions  on  nearby  clock  ratios),  the  rather  surprising 
result  (after  all  slow  processors  could  delay  each  other  and  such  delays  could 
cascade  Intolerably)  may  have  practical  Implications  for  clocking  disciplines 
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